Language Engineering System for Automatic Conversion of English Cyber Data into Urdu Websites

نویسندگان

  • Imran Sarwar Bajwa
  • M. Abbas Choudhary
چکیده

English is one of the most widely spoken languages in the world these days. Most of the commercial websites are also being designed in English Language. Modern software engineering trends supports better interfaces for effective Human Computer Interaction (HCI). One of the major HCI requirements is to provide data in human readable format. All people can not get benefits of cyber information which is usually available in English language. This paper presents a natural language engineering based framework for generating the Urdu language text from English language. A rule based algorithm is designed to read the input text, understand it and convert into another natural language. This research was conducted to incorporate the vigorous ability of this algorithm to read input English language text from websites and convert it into desired output natural language as Urdu. The designed system uses extensive markup language to represent the input text into a standard format. As XML also has a valuable ability to use Unicode for text representation, hence the text represented in the Unicode can effortlessly be converted into Urdu language. User has to specify the URL of the desired websites and the designed system translates that input webpage from English language into Urdu language. This is a very effective and efficient way of converting an English website into Urdu website with a single click of the mouse within no time.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Letter-To-Sound Conversion For Urdu Text-To-Speech System

Urdu is spoken by more than 100 million people across a score countries and is the national language of Pakistan (http://www. ethnologue.com). There is a great need for developing a text-to-speech system for Urdu because this population has low literacy rate and therefore speech interface would greatly assist in providing them access to information. One of the significant parts of a text-to-spe...

متن کامل

AGHAZ: An Expert System Based approach for the Translation of English to Urdu

–Machine Translation (MT ) of English text to its Urdu equivalent is a difficult challenge. Lot of attempts has been made, but a few limited solutions are provided till now. We present a direct approach, using an expert system to translate English text into its equivalent Urdu, using The Unicode Standard, Version 4.0 (ISBN 0-321-18578-1) Range: 0600–06FF. The expert system works with a knowledg...

متن کامل

Improvements in RWTH LVCSR evaluation systems for Polish, Portuguese, English, urdu, and Arabic

In this work, Portuguese, Polish, English, Urdu, and Arabic automatic speech recognition evaluation systems developed by the RWTH Aachen University are presented. Our LVCSR systems focus on various domains like broadcast news, spontaneous speech, and podcasts. All these systems but Urdu are used for Euronews and Skynews evaluations as part of the EUBridge project. Our previously developed LVCSR...

متن کامل

Using Machine Learning Algorithms for Automatic Cyber Bullying Detection in Arabic Social Media

Social media allows people interact to express their thoughts or feelings about different subjects. However, some of users may write offensive twits to other via social media which known as cyber bullying. Successful prevention depends on automatically detecting malicious messages. Automatic detection of bullying in the text of social media by analyzing the text "twits" via one of the machine l...

متن کامل

A Framework for Urdu Language Translation using LESSA

Internet is one of the major sources of information for the person belonging to almost all the fields of life. Major language that is used to publish information on internet is language. This thing becomes a problem in a country like Pakistan, where Urdu is the national language. Only 10% of Pakistan mass can understand English. The reason is millions of people are deprived of precious informat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012